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Abstract 

The  objective  of  this  project  is  to  provide  a  unifying  theoretical  and  computational  framework 
for  the  study  of  clique  relaxation  models  arising  in  biological  and  social  networks.  Increasing 
interest  in  studying  community  structures,  functional  modules,  or  clusters  in  complex  networks 
arising  in  biological  and  social  networks,  as  well  as  in  various  other  applications,  has  lead  to  a 
large  and  diverse  body  of  literature  introducing  numerous  graph-theoretic  models,  which  relax 
certain  characteristics  of  the  classical  clique  concept.  This  project  examines  the  elementary 
clique-defining  properties  inherently  exploited  in  the  available  clique  relaxation  models  and  pro¬ 
poses  a  taxonomic  framework  that  not  only  allows  to  classify  the  existing  models  in  a  systematic 
fashion,  but  also  yields  new  clique  relaxations  of  potential  practical  interest.  Based  on  the  pro¬ 
posed  taxonomy,  a  comprehensive  study  of  the  resulting  optimization  problems  is  carried  out, 
aiming  to  address  the  following  issues. 
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•  Systematically  studying  the  cohesiveness  properties  of  various  clique  relaxation  aiming  to 
assist  researchers  in  selecting  the  most  appropriate  model  for  a  particular  application  of 
interest. 

•  Exploring  the  fundamental  properties  of  the  clique  relaxation  models  of  interest  that  are 
responsible  for  the  computational  complexity  of  the  corresponding  optimization  problems, 
and  exploiting  these  properties  in  designing  appropriate  computational  tools  for  solving 
the  problems  in  question. 

•  Identifying  robust  clique  relaxation  structures  of  minimum  cost  that  are  of  practical  inter¬ 
est. 

The  applications  of  direct  interest  to  the  Air  Force  include  the  design,  management  and 
defense  of  complex  networks,  information  technology,  and  other  applications  as  described  in 
complex  networked  systems ,  robust  decision  making ,  and  socio-cultural  modeling  of  effective 
influence  discovery  challenge  thrusts. 


Summary  of  Research  Contributions 

The  main  research  contributions  resulting  from  the  project  are  summarized  below,  including  the 
list  of  papers  supported  by  this  project  that  have  been  finalized  and  are  published/accepted  for 
publication  or  submitted/ready  for  submission. 

The  unifying  framework  for  investigation  of  clique  relaxation  models  in  this  project  was  estab¬ 
lished  in  1 1 1 .  This  paper  analyzes  the  elementary  clique-defining  properties  implicitly  exploited  in 
the  available  clique  relaxation  models  and  proposes  a  taxonomic  framework  that  not  only  allows 
to  classify  the  existing  models  in  a  systematic  fashion,  but  also  yields  new  clique  relaxations  of 
potential  practical  interest.  According  to  the  proposed  taxonomy,  clique  relaxation  structures  were 
classified  into  regular /weak,  absolute/relative  models  that  either  restrict  a  violation  of  an  elemen¬ 
tary  clique  defining  property  or  enforce  a  fixed-size  clique  property  in  the  corresponding  definition. 
Higher-order  relaxations,  which  consider  more  than  one  elementary  clique  defining  property  simul¬ 
taneously,  have  also  been  introduced,  and  rigorous  bounds  on  the  cohesiveness  properties  guaran¬ 
teed  by  the  so-called  canonical  clique  relaxations  have  been  established.  With  respect  to  dynamic 
structural  properties,  the  analysis  focused  on  properties  related  to  the  concept  of  heredity.  It  was 
observed  that  the  presence  of  the  heredity  in  induced  subgraphs  property  is  essential  for  developing 
effective  combinatorial  branch-and-bound  strategies  for  detection  of  maximum  clique  relaxation 
structures.  The  concepts  of  weak  heredity,  quasi-heredity,  and  k- heredity  have  been  introduced 
and  their  algorithmic  implications  have  been  discussed.  These  developments  defined  the  further 
sequence  of  steps  taken  towards  achieving  the  project’s  objectives,  as  described  next. 

Hereditary  clique  relaxations 

Papers  |2j-[8]  study  hereditary  clique  relaxations ,  which,  in  addition  to  clique  itself,  include  s-plex, 
s-defective  clique,  s-bundle,  and  s-stable  cluster  models  arising  in  social  and  biological  networks. 
The  presence  of  heredity  allows  to  develop  effective  scale-reduction  procedures  and  exact  algorithms 
for  these  models  |2|.  As  a  result,  it  is  demonstrated  that  real-life  problem  instances  with  millions 
of  vertices  and  edges  can  be  solved  to  optimality  |3,4j.  Theoretical  explanation  of  such  a  success  in 
solving  very  large  instances  of  the  maximum  clique  problem,  which  is  notorious  for  its  intractability, 
is  provided  in  [5].  The  maximum  ratio  clique  problem  is  introduced  and  studied  for  the  first  time 
in  [6|.  The  fractional  objective  is  motivated  by  practical  consideration,  as  one  is  often  interested 
in  maximizing  benefit/cost  ratio  when  searching  for  a  cluster.  Easily  computable  analytical  lower 
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bounds  on  the  clique  number  are  studied  in  [7],  and  connections  of  the  maximum  clique  problem 
to  the  potential  energy  principles  in  networked  systems  are  discussed  in  [8|. 


Non-hereditary  clique  relaxations 


New  approaches  for  the  edge  density  based  quasi-clique  model,  which  does  not  posses  heredity  but 
is  quasi- hereditary,  are  developed  in  1 9 , 10  .  One  of  the  most  interesting  and  practically  important 


non-hereditary  clique  relaxations  is  the  distance-based  s-club  model,  which  is  defined  as  a  sub¬ 
set  of  nodes  inducing  a  subgraph  of  diameter  at  most  s.  Due  to  the  lack  of  heredity,  computing 
even  maximal  by  inclusion  s-clubs  is  an  NP-hard  problem.  A  survey  of  recent  progress  in  solv¬ 
ing  the  maximum  s-club  problem  and  its  weaker  counterpart,  the  maximum  s-clique  problem,  is 
An  exact  approach  for  the  maximum  s-club  problem  that  takes  advantage  of  the 

A  special  case  of  2-cliques  is  considered  in 


given  m 
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close  relation  to  s-cliques  is  proposed  in  12 
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where  a  0.5-approximation  algorithm  for  this  problem  in  unit-disk  graphs  is  proposed.  Dominating 
clubs,  modeling  low-diameter  virtual  backbones  in  networks,  are  studied  in  |14j.  Complexity  of 
gap  recognition  results  for  clubs  inspired  a  new  general  theoretical  framework  for  characterizing 
“provably  best”  construction  heuristics  for  hard  combinatorial  optimization  problems  [l5|.  A  new 
clique  relaxation  structure,  a-cluster,  based  on  the  concept  of  a  clustering  coefficient  is  introduced 
and  studied  in  the  context  of  social  networks  in 


16  . 


Robustness  considerations 

Several  approaches  are  used  to  address  robustness  considerations  in  this  project.  The  main  focus  is 
on  ensuring  connectivity,  which  is  perhaps  the  most  essential  requirement  that  can  be  imposed  on 
a  robust  network  cluster.  The  connected  subgraph  polytope  of  a  graph  is  studied  in  detail  in  17 


Approaches  for  finding  maximum  subgraphs  with  relatively  large  vertex  connectivity  are  proposed 
in  |l  S.  An  effective  integer  programming  approach  for  computing  fault-tolerant  connected  dom¬ 
inating  sets  is  developed  in  1 19 1 .  This  approach  performs  much  better  than  previously  published 
methods.  Minimization  version  of  k- core  and  related  problems  is  suggested  as  a  way  of  obtaining 
robust  clusters  and  tight  upper  bounds  on  the  clique  number  in  20  and  21  ,  respectively.  Fi¬ 


nally,  stochastic  risk-averse  versions  of  the  spanning  k- core,  maximum  clique,  and  maximum  2-club 
problems  are  investigated  in  1 22] ,  23],  and  24  ,  respectively. 
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Appendix:  Abstracts  of  Papers 

1.  J.  Pattillo,  N.  Youssef,  and  S.  Butenko.  On  clique  relaxation  models  in  network  analysis. 
European  Journal  of  Operational  Research  226  (2013),  9-18. 

Abstract 

Increasing  interest  in  studying  community  structures ,  or  clusters  in  complex  networks 
arising  in  various  applications  has  led  to  a  large  and  diverse  body  of  literature  introducing 
numerous  graph-theoretic  models  relaxing  certain  characteristics  of  the  classical  clique  con¬ 
cept.  This  paper  analyzes  the  elementary  clique-defining  properties  implicitly  exploited  in 
the  available  clique  relaxation  models  and  proposes  a  taxonomic  framework  that  not  only 
allows  to  classify  the  existing  models  in  a  systematic  fashion,  but  also  yields  new  clique 
relaxations  of  potential  practical  interest.  Some  basic  structural  properties  of  several  of 
the  considered  models  are  identified  that  may  facilitate  the  choice  of  methods  for  solving 
the  corresponding  optimization  problems.  In  addition,  bounds  describing  the  cohesiveness 
properties  of  different  clique  relaxation  structures  are  established,  and  practical  implica¬ 
tions  of  choosing  one  model  over  another  are  discussed. 

2.  S.  Trukhanov,  B.  Balasundaram,  S.  Butenko,  and  C.  Balasubramaniam.  Algorithms  for 
detecting  optimal  hereditary  structures  in  graphs,  with  application  to  clique  relaxations. 
Computational  Optimization  and  Applications  56  (2013),  113-130. 

Abstract 

The  general  node  deletion  problem  can  be  stated  as  follows:  Given  a  graph  property  II, 
find  the  minimum  number  of  nodes  whose  deletion  results  in  a  graph  satisfying  property  II. 

In  1978,  Yannakakis  established  that  if  the  property  II  is  hereditary  on  induced  subgraphs, 
nontrivial ,  and  interesting,  the  resulting  node  deletion  problem  is  NP-hard.  Several  well- 
known  graph  properties  such  as  requiring  the  graph  to  be  complete,  planar,  perfect,  and 
bipartite,  among  others,  meet  the  conditions  of  this  result,  highlighting  its  scope.  This 
paper  proposes  a  general  purpose  exact  algorithmic  framework  to  solve  such  NP-hard  node 
deletion  problems,  and  investigates  key  algorithm  design  and  implementation  issues  that 
will  help  tailor  the  general  framework  for  specific  graph  properties.  The  performance  of 
the  algorithms  so  derived  for  the  maximum  s-plex  problem,  and  the  maximum  s-defective 
clique  problem,  is  assessed  through  a  computational  study.  These  problems  are  used  in 
graph-based  data  mining  applications,  specifically  in  social  and  biological  network  analysis. 

3.  A.  Verma,  A.  Buchanan,  and  S.  Butenko.  Solving  the  maximum  clique  and  vertex  coloring 
problems  on  very  large  sparse  networks.  INFORMS  Journal  on  Computing  27  (2015),  164- 
177. 
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Abstract 


This  paper  explores  techniques  for  solving  the  maximum  clique  and  vertex  coloring 
problems  on  very  large-scale  real  life  networks.  Due  to  the  size  of  such  networks  and  the 
intractability  of  the  considered  problems,  previously  developed  exact  algorithms  may  not  be 
directly  applicable.  The  proposed  approaches  aim  to  reduce  the  network  instances  to  a  size 
that  is  tractable  for  existing  solvers,  while  preserving  optimality.  Two  clique  relaxation 
structures  are  exploited  for  this  purpose.  In  addition  to  the  known  fc- core  structure,  a 
newly  introduced  clique  relaxation,  k-community,  is  used  to  further  reduce  the  instance 
size.  Experimental  results  on  real  life  graphs  (collaboration  networks,  P2P  networks,  social 
networks,  etc.)  show  the  proposed  procedures  to  be  effective  by  finding,  for  the  first  time, 
exact  solutions  for  instances  with  over  18  million  vertices. 

4.  Z.  Miao  and  B.  Balasundaram.  Approaches  for  finding  cohesive  subgroups  in  large-scale  social 
networks  via  maximum  fc-plex  detection.  Under  review  in  Networks,  August  2014. 

Abstract 

A  fc-plex  is  a  clique  relaxation  introduced  in  social  network  analysis  to  model  cohesive 
social  subgroups.  Several  exact  algorithms  and  heuristic  approaches  to  find  a  maximum  size 
fc-plex  in  the  graph  have  been  developed  recently  for  this  NP-hard  problem.  This  article 
develops  a  basic  greedy  randomized  adaptive  search  procedure  (GRASP)  for  the  maximum 
fc-plex  problem,  with  a  key  improvement  in  the  design  of  the  construction  procedure  that 
alleviates  a  drawback  observed  in  multiple  past  studies.  In  existing  construction  heuristics, 
fc-plexes  found  for  smaller  values  of  parameter  k  are  sometimes  not  found  for  larger  k  even 
though  they  are  feasible,  and  inferior  solutions  are  found  instead.  We  identify  the  reasons 
behind  this  behavior  and  address  those  in  our  new  construction  procedure.  We  then  show 
that  an  existing  exact  algorithm  for  solving  this  problem  on  power-law  graphs  can  be 
considerably  enhanced  by  using  GRASP  and  “peeling,”  a  problem-specific  preprocessing 
procedure.  The  overall  approach  is  able  to  solve  the  problem  to  optimality  on  very  large- 
scale  social  networks  with  over  100,000  vertices  to  those  with  over  4.8  million  vertices. 
These  are  several  orders  of  magnitude  larger  than  the  largest  real-life  social  networks  on 
which  this  problem  has  been  solved  to  optimality  in  current  literature. 

5.  A.  Buchanan,  J.  L.  Walteros,  S.  Butenko,  and  P.  M.  Pardalos.  Solving  maximum  clique  in 
sparse  graphs:  an  0(nm  +  2d/4)  algorithm  for  d-degenerate  graphs.  Optimization  Letters  8 
(2014),  1611-1617. 


Abstract 

We  describe  an  algorithm  for  the  maximum  clique  problem  that  is  parameterized  by  the 
graph’s  degeneracy  d.  The  algorithm  runs  in  0(nm.  +  nTd)  time,  where  Td  is  the  time  to 
solve  the  maximum  clique  problem  in  an  arbitrary  graph  on  d  vertices.  The  best  bound  as 
of  now  is  Td  =  0(2d/4)  by  Robson.  This  shows  that  the  maximum  clique  problem  is  solvable 
in  0(nm)  time  in  graphs  for  which  d  <  41og2m  +  O(l).  The  analysis  of  the  algorithm’s 
runtime  is  simple;  the  algorithm  is  easy  to  implement  when  given  a  subroutine  for  solving 
maximum  clique  in  small  graphs;  it  is  easy  to  parallelize.  In  the  case  of  Bianconi-Marsili 
power-law  random  graphs,  it  runs  in  2°(v”)  time  with  high  probability.  We  extend  the 
approach  for  a  graph  invariant  based  on  common  neighbors,  generating  a  second  algorithm 
that  has  a  smaller  exponent  at  the  cost  of  a  larger  polynomial  factor. 

6.  S.  Sethuraman  and  S.  Butenko.  The  maximum  ratio  clique  problem.  Computational  Man¬ 
agement  Science  12  (2015),  197-218. 


Abstract 
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This  paper  introduces  a  fractional  version  of  the  classical  maximum  weight  clique 
problem,  the  maximum  ratio  clique  problem ,  which  is  to  find  a  maximal  clique  that  has 
the  largest  ratio  of  benefit  and  cost  weights  associated  with  the  clique’s  vertices.  NP- 
completeness  of  the  decision  version  of  the  problem  is  established,  and  three  solution  meth¬ 
ods  are  proposed.  The  results  of  numerical  experiments  with  standard  graph  instances,  as 
well  as  with  real-life  instances  arising  in  finance  and  energy  systems,  are  reported. 

7.  V.  Stozhkov,  G.  Pastukhov,  V.  Boginski,  and  E.  L.  Pasiliao.  New  analytical  lower  bounds  on 
the  clique  number  of  a  graph.  Submitted  October  2015. 

Abstract 

This  paper  proposes  three  new  analytical  lower  bounds  on  the  clique  number  of  a  graph 
and  compares  these  bounds  with  those  previously  established  in  the  literature.  Two  pro¬ 
posed  bounds  are  derived  from  the  well-known  Motzkin-Straus  quadratic  programming 
formulation  for  the  maximum  clique  problem.  Theoretical  results  on  the  comparison  of 
various  bounds  are  established.  Computational  experiments  were  performed  on  various 
random  graph  models  such  as  Erdos-Renyi  model  for  uniform  graphs  and  a  generalized 
graph  model  for  power-law  graphs  that  simulate  graphs  with  different  densities  and  as- 
sortativity  coefficients.  Computational  results  suggest  that  the  proposed  new  analytical 
bounds  improve  the  existing  ones  on  many  graph  instances. 

8.  A.  Veremyev,  V.  Boginski,  E.L.  Pasiliao.  Potential  energy  principles  in  networked  systems 
and  their  connections  to  optimization  problems  on  graphs.  Optimization  Letters  9  (2015) 
585-600. 


Abstract 

We  introduce  the  concepts  of  “gravitational”  and  “elastic”  potential  energy  in  the  con¬ 
text  of  networks  and  investigate  their  implications  on  interpreting  certain  well-known  graph- 
theoretic  concepts  and  optimization  problems.  In  the  case  of  gravitational  potential  energy, 
we  treat  the  nodes  of  a  graph  as  “particles”  with  masses  and  consider  the  potential  energy 
of  gravitational  interactions  between  them.  We  prove  that  the  maximum  clique  in  a  graph 
represents  the  minimum  gravitational  potential  energy  structure.  This  result  yields  a  nat¬ 
ural  physics-based  interpretation  of  the  well-  known  Motzkin-Strauss  formulation  for  the 
maximum  clique  problem.  In  the  case  of  elastic  potential  energy,  we  assume  that  graph 
nodes  are  located  on  an  elastic  surface,  graph  edges  represent  “springs”  connecting  the 
nodes  on  a  surface,  and  consider  potential  energy  corresponding  to  elastic  deformations  of 
springs.  We  show  that  under  certain  reasonable  assumptions  on  surface  deformation,  the 
second-smallest  and  the  largest  eigenvalues  of  the  graph  Laplacian  matrix  are  essentially 
the  minimum  and  the  maximum  elastic  potential  energy  of  a  graph,  respectively.  Moreover, 
the  associated  eigenvectors  correspond  to  node  heights  in  the  minimum  and  maximum  en¬ 
ergy  node  configurations.  Note  that  the  second-smallest  eigenvalue  of  the  graph  Laplacian 
is  widely  known  as  algebraic  connectivity,  and  it  turns  out  that  this  concept  also  has  a 
simple  and  intuitive  physics-based  interpretation. 

9.  Z.  Miao  and  B.  Balasundaram.  Lagrangian  dual  bounds  for  the  maximum  quasi-clique  prob¬ 
lem.  In  preparation.  Anticipated  submission,  December  2015. 

Abstract 

A  7-quasi-clique  in  a  simple  undirected  graph  refers  to  a  subset  of  vertices  that  induces 
a  subgraph  with  edge  density  at  least  7  G  [0, 1].  When  7  =  1,  this  definition  corresponds 
to  a  classical  clique.  When  7  <  1,  it  relaxes  the  requirement  of  all  possible  edges  in  the 
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clique  definition.  Quasi-clique  has  been  used  to  detect  dense  clusters  in  graph-based  data 
mining,  especially  in  large-scale,  error-prone  data  sets  in  which  clique  model  can  be  overly 
restrictive.  The  maximum  7-quasi-clique  problem  which  detects  a  maximum  cardinality 
7-quasi-clique  from  a  given  graph,  can  be  formulated  as  a  mathematical  program  with 
a  linear  objective  function  and  a  single  quadratic  constraint  in  binary  variables.  This 
article  investigates  the  Lagrangian  dual  of  the  formulation,  and  develops  an  upper  bounding 
technique  based  on  approximately  solving  the  Lagrangian  dual.  The  upper-bounds  based 
on  our  approach  is  significantly  tighter  in  comparison  to  the  current  state-of-the-art  mixed- 
integer  programming  solvers  employing  two  well-known  formulations  for  this  problem.  We 
establish  the  strength  of  the  relaxation  mathematically  and  via  numerical  experiments  on 
DIMACS  benchmark  instances. 

10.  F.  Mahdavi  Pajouh,  Z.  Miao,  and  B.  Balasundaram.  A  branch-and-bound  approach  for 
maximum  quasi-cliques.  Annals  of  Operations  Research  216  (2014),  145-161. 

Abstract 

Detecting  quasi-cliques  in  graphs  is  a  useful  tool  for  detecting  dense  clusters  in  graph- 
based  data  mining.  Particularly  in  large-scale  data  sets  that  are  error-prone,  cliques  are 
overly  restrictive  and  impractical.  Quasi-clique  detection  has  been  accomplished  using 
heuristic  approaches  in  various  applications  of  graph-based  data  mining  in  protein  inter¬ 
action  networks,  gene  co-expression  networks,  and  telecommunication  networks.  Quasi¬ 
cliques  are  not  hereditary,  in  the  sense  that  every  subset  of  a  quasi-clique  need  not  be  a 
quasi-clique.  This  lack  of  heredity  introduces  interesting  challenges  in  the  development  of 
exact  algorithms  to  detect  maximum  cardinality  quasi-cliques.  The  only  exact  approaches 
for  this  problem  are  limited  to  two  mixed  integer  programming  formulations  that  were 
recently  proposed  in  the  literature.  The  main  contribution  of  this  article  is  a  new  combi¬ 
natorial  branch-and-bound  algorithm  for  the  maximum  quasi-clique  problem. 

11.  S.  Shahinpour  and  S.  Butenko.  “Dinstance-based  clique  relaxations  in  networks:  s-cliques 
and  s-clubs”.  In:  Models,  Algorithms,  and  Technologies  for  Network  Analysis.  Ed.  by  B. 
I.  Goldengorin,  V.  A.  Kalyagin,  and  P.  M.  Pardalos.  Vol.  59.  Springer  Proceedings  in 
Mathematics  Statistics.  New  York:  Spinger  Science  +  Business  Media,  2013,  pp.  149-174. 

Abstract 

The  concept  of  the  clique,  originally  introduced  as  a  model  of  a  cohesive  subgroup  in 
the  context  of  social  network  analysis,  is  a  classical  model  of  a  cluster  in  networks.  How¬ 
ever,  the  ideal  cohesiveness  properties  guaranteed  by  the  clique  definition  put  limitations 
on  its  applicability  to  situations  where  enforcing  such  properties  is  unnecessary  or  even  pro¬ 
hibitive.  Motivated  by  practical  applications  of  diverse  origins,  numerous  clique  relaxation 
models,  which  are  obtained  by  relaxing  certain  properties  of  a  clique,  have  been  introduced 
and  studied  by  researchers  representing  different  fields.  Distance-based  clique  relaxations, 
which  replace  the  requirement  on  pairwise  distances  to  be  equal  to  1  in  a  clique  with  less 
restrictive  distance  bounds,  are  among  the  most  important  such  models.  This  chapter  sur¬ 
veys  the  up-to-date  progress  made  in  studying  two  common  distance-based  clique  relaxation 
models  called  s-clique  and  s-club,  as  well  as  the  corresponding  optimization  problems. 

12.  E.  Moradi  and  B.  Balasundaram.  Finding  a  maximum  /c-club  using  the  A:-clique  formulation 
and  canonical  hypercube  cuts.  Conditionally  accepted  at  Optimization  Letters,  October  2015. 

Abstract 

Detecting  low-diameter  clusters  is  an  important  graph-based  data  mining  technique 
used  in  social  network  analysis,  bioinformatics  and  text-mining.  Low  pairwise  distances 
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within  a  cluster  can  facilitate  fast  communication  or  good  reachability  between  vertices  in 
the  cluster.  Formally,  a  subgraph  of  diameter  at  most  k  is  called  a  fc-club.  For  low  values 
of  the  parameter  k,  this  model  offers  a  graph-theoretic  relaxation  of  the  clique  model  that 
formalizes  the  notion  of  a  low-diameter  cluster.  Using  a  combination  of  graph  decomposition 
and  model  decomposition  techniques,  we  demonstrate  how  the  fundamental  optimization 
problem  of  finding  a  maximum  size  fc-club  can  be  solved  optimally  on  large-scale  benchmark 
networks  that  are  available  in  the  public  domain.  Our  approach  circumvents  the  use  of 
complicated  formulations  of  the  maximum  fc-club  problem  in  favor  of  a  simple  relaxation 
based  on  necessary  conditions,  combined  with  canonical  hypercube  cuts  introduced  by  Balas 
and  Jeroslow. 

13.  J.  Pattillo,  Y.  Wang,  and  S.  Butenko.  Approximating  2-cliques  in  unit  disk  graphs.  Discrete 
Applied  Mathematics  166  (2014),  178-187. 

Abstract 

This  paper  studies  distance-based  clique  relaxations  in  unit  disk  graphs  arising  in  wire¬ 
less  networking  applications.  Namely,  a  2-clique  is  a  subset  of  nodes  with  pairwise  distance 
at  most  two  in  the  graph,  and  a  2-club  is  a  subset  of  nodes  inducing  a  subgraph  of  diameter 
two.  It  is  shown  that  in  a  unit  disk  graph  any  2-clique  is  4-dominated  and  any  2-club  is 
3-dominated.  The  former  observation  is  used  to  develop  a  ,) -approximation  algorithm  for 
the  maximum  2-clique  problem  in  unit  disk  graphs.  Moreover,  this  also  implies  polynomial 
solvability  of  the  minimum  dominating  set  problem  in  unit  disk  graphs  of  diameter  two, 
whereas  the  same  problem  is  shown  to  be  hard  in  general  diameter-two  graphs.  The  paper 
also  poses  several  related  open  questions  of  interest. 

14.  A.  Buchanan,  J.  Sung,  V.  Boginski,  and  S.  Butenko.  On  connected  dominating  sets  of 
restricted  diameter.  European  Journal  of  Operational  Research  236  (2014),  410-418. 

Abstract 

A  connected  dominating  set  (CDS)  is  commonly  used  to  model  a  virtual  backbone 
of  a  wireless  network.  To  bound  the  distance  that  information  must  travel  through  the 
network,  we  explicitly  restrict  the  diameter  of  a  CDS  to  be  no  more  than  s  leading  to 
the  concept  of  a  dominating  s-club.  We  prove  that  for  any  fixed  positive  integer  s  it  is 
NP-complete  to  determine  if  a  graph  has  a  dominating  s-club,  even  when  the  graph  has 
diameter  s  +  1.  As  a  special  case  it  is  NP-complete  to  determine  if  a  graph  of  diameter  two 
has  a  dominating  clique.  We  then  propose  a  compact  integer  programming  formulation 
for  the  related  minimization  problem,  enhance  the  approach  with  variable  fixing  rules  and 
valid  inequalities,  and  present  computational  results. 

15.  S.  Kahruman-Anderoglu,  A.  Buchanan,  S.  Butenko,  and  O.  Prokopyev.  On  provably  best 
construction  heuristics  for  hard  combinatorial  optimization  problems.  Networks  (2015).  To 
appear,  DOI  10. 1002/net. 21620. 


Abstract 

In  this  paper,  a  heuristic  is  said  to  be  provably  best  if,  assuming  V  ^  AfV,  no  other 
heuristic  always  finds  a  better  solution  (when  one  exists).  This  extends  the  usual  notion 
of  “best  possible”  approximation  algorithms  to  include  a  larger  class  of  heuristics.  We 
illustrate  the  idea  on  several  problems  that  are  somewhat  stylized  versions  of  real-life  net¬ 
work  optimization  problems,  including  the  maximum  clique,  maximum  fc-club,  minimum 
(connected)  dominating  set,  and  minimum  vertex  coloring  problems.  The  correspond¬ 
ing  provably  best  construction  heuristics  resemble  those  commonly  used  within  popular 
metaheuristics.  Along  the  way,  we  show  that  it  is  hard  to  recognize  whether  the  clique 
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number  and  the  fc-club  number  of  a  graph  are  equal,  yet  a  polytime  computable  function 
is  “sandwiched”  between  them.  This  is  similar  to  the  celebrated  Lovasz  function  wherein 
an  efficiently  computable  function  lies  between  two  graph  invariants  that  are  ATP-hard  to 
compute. 

16.  Z.  Ertem,  A.  Veremyev,  and  S.  Butenko.  Detecting  large  cohesive  subgroups  with  high  clus¬ 
tering  coefficients  in  social  networks.  Under  review  at  Social  Networks,  submitted  December 
2014. 


Abstract 

Clique  relaxations  are  used  in  classical  models  of  cohesive  subgroups  in  social  network 
analysis.  Clustering  coefficient  was  introduced  more  recently  as  a  structural  feature  charac¬ 
terizing  small- world  networks.  Noting  the  similarities  of  the  concepts  of  cohesive  subgroups 
and  small-world  networks,  this  paper  introduces  a  new  clique  relaxation,  a-cluster,  defined 
by  enforcing  a  lower  bound  a  on  the  clustering  coefficient  in  the  corresponding  induced 
subgraph.  Two  variations  of  the  clustering  coefficient  are  considered,  namely,  the  local 
and  global  clustering  coefficient.  Certain  structural  properties  of  a-clusters  are  analyzed 
and  mathematical  optimization  models  for  determining  a-clusters  of  the  largest  size  in  a 
network  are  developed  and  validated  using  several  real-life  social  network  instances.  In  ad¬ 
dition,  a  network  clustering  algorithm  based  on  local  a-clusters  is  proposed  and  successfully 
tested. 

17.  Y.  Wang,  A.  Buchanan,  and  S.  Butenko.  On  imposing  connectivity  constraints  in  integer 
programs.  Submitted,  June  2015.  Available  online  at 
http://www.optimization-online.org/DB_HTML/2015/02/4768.html 

Abstract 

In  many  network  applications,  one  searches  for  a  connected  subset  of  vertices  that 
exhibits  other  desirable  properties.  To  this  end,  this  paper  studies  the  connected  subgraph 
polytope  of  a  graph,  which  is  the  convex  hull  of  subsets  of  vertices  that  induce  a  connected 
subgraph.  Much  of  our  work  is  devoted  to  the  study  of  two  nontrivial  classes  of  valid 
inequalities.  The  first  are  the  a ,  6-separator  inequalities,  which  have  been  successfully  used 
to  enforce  connectivity  in  previous  computational  studies.  The  second  are  the  indegree 
inequalities,  which  have  previously  been  shown  to  induce  all  nontrivial  facets  for  trees.  We 
determine  the  precise  conditions  under  which  these  inequalities  induce  facets  and  when  each 
class  fully  describes  the  connected  subgraph  polytope.  Both  classes  of  inequalities  can  be 
separated  in  polynomial  time  and  admit  compact  extended  formulations.  However,  while 
the  a,  6-separator  inequalities  can  be  lifted  in  linear  time,  it  is  NP-hard  to  lift  the  indegree 
inequalities. 

18.  A.  Veremyev,  O.A.  Prokopyev,  V.  Boginski,  and  E.L.  Pasiliao.  Finding  maximum  subgraphs 
with  relatively  large  vertex  connectivity.  European  Journal  of  Operational  Research  239  (2014) 
349-362. 


Abstract 

We  consider  a  clique  relaxation  model  based  on  the  concept  of  relative  vertex  connec¬ 
tivity.  It  extends  the  classical  definition  of  a  fc-vertex-connected  subgraph  by  requiring  that 
the  minimum  number  of  vertices  whose  removal  results  in  a  disconnected  (or  a  trivial)  graph 
is  proportional  to  the  size  of  this  subgraph,  rather  than  fixed  at  k.  Consequently,  we  further 
generalize  the  proposed  approach  to  require  vertex-connectivity  of  a  subgraph  to  be  some 
function  f  of  its  size.  We  discuss  connections  of  the  proposed  models  with  other  clique  relax¬ 
ation  ideas  from  the  literature  and  demonstrate  that  our  generalized  framework,  referred  to 
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as  f-vertex-connectivity,  encompasses  other  known  vertex-connectivity-based  models,  such 
as  s-bundle  and  /c-block.  We  study  related  computational  complexity  issues  and  show  that 
finding  maximum  subgraphs  with  relatively  large  vertex  connectivity  is  NP-hard.  An  in¬ 
teresting  special  case  that  extends  the  i?-robust  2-club  model  recently  introduced  in  the 
literature,  is  also  considered.  In  terms  of  solution  techniques,  we  first  develop  general  lin¬ 
ear  mixed  integer  programming  (MIP)  formulations.  Then  we  describe  an  effective  exact 
algorithm  that  iteratively  solves  a  series  of  simpler  MIPs,  along  with  some  enhancements, 
in  order  to  obtain  an  optimal  solution  for  the  original  problem.  Finally,  we  perform  com¬ 
putational  experiments  on  several  classes  of  random  and  real-life  networks  to  demonstrate 
performance  of  the  developed  solution  approaches  and  illustrate  some  properties  of  the 
proposed  clique  relaxation  models. 

19.  A.  Buchanan,  J.  Sung,  S.  Butenko,  and  E.  L.  Pasiliao.  An  integer  programming  approach 
for  fault-tolerant  connected  dominating  sets.  INFORMS  Journal  on  Computing  27  (2015), 
178-188. 


Abstract 

This  paper  considers  the  minimum  fc-connected  d-dominating  set  problem,  which  is  a 
fault-tolerant  generalization  of  the  minimum  connected  dominating  set  (MCDS)  problem. 
Three  integer  programming  formulations  based  on  vertex-cuts  are  proposed  (depending  on 
whether  d  <  k,  d  =  k ,  or  d  >  k)  and  their  integer  hulls  are  studied.  The  separation  problem 
for  the  vertex-cut  inequalities  is  a  weighted  vertex-connectivity  problem  and  is  polytime 
solvable,  meaning  that  the  LP  relaxation  can  be  solved  in  polynomial  time  despite  having 
exponentially  many  constraints.  A  new  class  of  valid  inequalities  -  r-robust  vertex-cut 
inequalities  -  is  introduced  and  is  shown  to  induce  exponentially  many  facets.  Finally,  a 
lazy-constraint  approach  is  shown  to  compare  favorably  with  existing  approaches  for  the 
MCDS  problem  (the  case  k  =  d  =  1),  and  is  in  fact  the  fastest  in  literature  for  standard 
test  instances.  A  key  subroutine  is  an  algorithm  for  finding  an  inclusion-wise  minimal 
vertex-cut  in  linear  time.  Computational  results  for  k  =  d  =  2,3,4  are  provided  as  well. 

20.  C.  Balasubramaniam  and  S.  Butenko.  On  robust  clusters  of  minimum  cardinality  in  networks. 
Annals  of  Operations  Research  (2015).  To  appear,  DOI  10.1007/sl0479-015-1992-4. 

Abstract 

This  paper  studies  two  clique  relaxation  models,  fc-blocks  and  /c-robust  2-clubs,  used 
to  describe  structurally  cohesive  clusters  with  good  robustness  and  reachability  properties. 

The  minimization  version  of  the  two  problems  are  shown  to  be  hard  to  approximate  for  k  >  3 
and  k  >  4,  respectively.  Integer  programming  formulations  are  proposed  and  a  polyheral 
study  is  presented.  The  results  of  sample  numerical  experiments  on  several  graph  instances 
are  also  reported. 

21.  C.  Balasubramaniam,  S.  Butenko,  and  B.  Balasundaram.  On  upper  bounds  for  the  maximum 
clique  problem.  Submitted,  October  2015. 

Abstract 

In  this  paper,  we  develop  strong  and  easily  computable  upper  bounds  for  the  maximum 
clique  problem  that  take  advantage  of  continuous  relaxations  of  integer  programming  for¬ 
mulations  for  the  minimum  k- core  problem  and  its  diameter-restricted  version.  We  show 
that  the  proposed  upper  bounds  are  better  than  those  obtained  from  the  standard  relax¬ 
ations  of  the  clique  polytope.  We  also  report  the  results  of  computational  experiments  with 
several  classes  of  graph  instances  and  provide  a  comparison  with  some  existing  bounds  from 
the  literature. 
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22.  J.  Ma  and  B.  Balasundaram.  On  the  chance-constrained  minimum  spanning  k- core  problem. 
In  preparation.  Anticipated  submission,  December  2015. 

Abstract 

A  graph  in  which  the  degree  of  every  vertex  is  at  least  k  is  called  a  k- core.  If  the 
parameter  k  is  sufficiently  large  relative  to  the  number  of  vertices,  a  k-core  is  guaranteed 
to  possess  2-hop  reachability  between  all  pairs  of  vertices.  Furthermore,  it  is  guaranteed 
to  preserve  pairwise  2-hop  distances  under  single  vertex  deletion.  Hence,  the  notion  of 
a  k- core  can  be  used  to  produce  2-liop  survivable  network  designs,  specifically  to  design 
inter-hub  networks.  Formally,  given  an  edge-weighted  graph,  the  minimum  spanning  k- core 
problem  seeks  to  find  a  spanning  subgraph  of  the  given  graph  with  minimum  total  edge 
weight  that  is  also  a  k-core.  For  any  fixed  k ,  this  problem  is  equivalent  to  a  generalized 
graph  matching  problem,  and  can  be  solved  in  polynomial  time.  This  article  focuses  on 
a  chance  constrained  version  of  the  minimum  spanning  k- core  problem  under  probabilis¬ 
tic  edge  failures.  We  first  establish  the  complexity  of  this  problem  by  proving  that  it  is 
NP-lrard.  Subsequently,  we  conduct  a  polyhedral  study  to  identify  faceting  inducing  in¬ 
equalities,  and  then  develop  a  strengthened  formulation.  The  quality  of  bounds  produced 
by  the  strengthened  formulation,  and  its  impact  on  solving  this  problem  is  demonstrated 
through  a  computational  study. 

23.  O.  Yezerska,  S.  Butenko,  and  V.  Boginski.  Detecting  robust  cliques  in  graphs  subject  to 
uncertain  edge  failures.  Under  review  at  Annals  of  Operations  Research,  submitted  June 
2015. 


Abstract 

This  paper  develops  and  compares  several  heuristic  approaches,  as  well  as  an  exact  com¬ 
binatorial  branch-and-bound  algorithm,  for  detecting  maximum  robust  cliques  in  graphs 
subjected  to  multiple  uncertain  edge  failures.  The  desired  robustness  properties  are  en¬ 
forced  using  Conditional  Value-at-Risk  (CVaR)  risk  measure.  The  proposed  heuristics  are 
adaptations  of  the  well-known  tabu  search  and  GRASP  methods,  whereas  the  exact  ap¬ 
proach  is  an  extension  of  Ostergard’s  algorithm  for  the  maximum  clique  problem.  The 
results  of  computational  experiments  on  DIMACS  graph  instances  are  reported. 

24.  F.  Mahdavi  Pajouh,  E.  Moradi,  and  B.  Balasundaram.  Detecting  large  risk-averse  2-clubs  in 
graphs  with  random  edge  failures.  Under  review  at  Annals  of  Operations  Research ,  submitted 
December  2014. 


Abstract 

Detecting  large  2-clubs  in  biological,  social  and  financial  networks  can  help  reveal  im¬ 
portant  information  about  the  structure  of  the  underlying  systems.  In  large-scale  networks 
that  are  error-prone,  the  uncertainty  associated  with  the  existence  of  an  edge  between  two 
vertices  can  be  modeled  by  assigning  a  failure  probability  to  that  edge.  Here,  we  study 
the  problem  of  detecting  large  “risk-averse”  2-clubs  in  graphs  subject  to  probabilistic  edge 
failures.  To  achieve  risk  aversion,  we  first  model  the  loss  in  2-club  property  due  to  proba¬ 
bilistic  edge  failures  as  a  function  of  the  decision  (chosen  2-club  cluster)  and  randomness 
(graph  structure).  Then,  we  utilize  the  conditional  value-at-risk  (CVaR)  of  the  loss  for  a 
given  decision  as  a  quantitative  measure  of  risk  for  that  decision,  which  is  bounded  in  the 
model.  More  precisely,  the  problem  is  modeled  as  a  CVaR-constrained  single-stage  stochas¬ 
tic  program.  The  main  contribution  of  this  article  is  a  new  decomposition  algorithm  based 
on  a  Benders  decomposition  scheme,  which  outperforms  an  algorithm  based  on  an  existing 
decomposition  idea,  on  a  test-bed  of  randomly  generated  instances,  and  real-life  biological 
and  social  networks. 
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